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ABSTRACT 

We present a new analytic calculation for the redshift-space evolution of the 1-point 
galaxy Probability Distribution Function (PDF) . The nonlinear evolution of the mat- 
ter density field is treated by second-order Eulerian perturbation theory and trans- 
formed to the galaxy density field via a second-order local biasing scheme. We then 
transform the galaxy density field to redshift space, again to second order. Our method 
uses an exact statistical treatment based on the Chapman-Kolmogorov equation to 
propagate the probability distribution of the initial mass field to the final redshifted 
galaxy density field. We derive the moment generating function of the PDF and use 
it to find a new, closed-form expression for the skewness of the redshifted galaxy dis- 
tribution. We show that our formalism is general enough to allow a non-deterministic 
(or stochastic) biasing prescription. We demonstrate the dependence of the redshift 
space PDF on cosmological and biasing parameters. Our results are compared with 
existing models for the PDF in redshift space and with the results of biased N-body 
simulations. We find that our PDF accurately models the redshift space evolution and 
the nonlinear biasing. 
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1 INTRODUCTION 



One of the central goals of modern cosmology is to gain 
a quantitative understanding of the nature of the observed 
galaxy distribution. In order that such an understanding be 
complete it must incorporate an explanation of both the 
evolution of large-scale structure in the universe and of the 
relationship between galaxies and the underlying dark mass. 

In the standard paradigm the origin of structure can 
be traced back to small perturbations in the mass density 
imprinted during an inflationary phase. These perturbations 
subsequently grow in amplitude through gravitational insta- 
bility. This prescription is well tested in N-body simulations 
and can be used to make useful predictions about the clus- 
tering history of dark matter. Many of the details, however, 
are not well understood. 

The picture becomes more complex when one considers 
the galaxy distribution. Firstly, the astrophysical processes 
that govern the formation of the galaxies themselves intro- 
duce a nontrivial relationship between the galaxies and the 
dark matter. This relationship is generally termed galaxy 
bias. The biasing relation may prove to be simple to de- 
scribe, depending only upon the local density in dark mat- 
ter at a given point (eg Coles 1993). Conversely it could 
be extremely complex, depending on the larger scale en- 



vironment, tidal fields, feedback and entire history of the 
galaxy formation site. This could lead to non-local (Bower 
et al. 1993) or non-deterministic forms of biasing (Pen 1998, 
Dekel & Lahav 1999). Secondly, the observed galaxy distri- 
bution is distorted in 3D redshift catalogues due to the line 
of sight components of the galaxy peculiar motions adding to 
the general Hubble expansion (Kaiser 1987). These redshift 
distortions, if not properly modeled, may lead to misinter- 
pretation when comparing theory with observation. 

A complete specification of galaxy clustering may only 
truly be given by the full set of galaxy N-point correlation 
functions. Such functions form the solution to the so-called 
BBGKY system of dynamical equations. This approach, pi- 
oneered in the 1970s by Peebles and co workers (Davis & 
Peebles 1977, Groth & Peebles 1977, Fry & Peebles 1978, 
Peebles 1980), has met with little success in practice. From a 
theoretical perspective, a closed solution to the BBGKY hi- 
erarchy has never been found and remains a thorny problem. 
Observationally, measurements of the correlation functions 
have been restricted to the lowest few orders. 

An alternative description, and the focus of this work, 
may be given by the Probability Distribution Function 
(PDF) of a random field. Strictly, we investigate the PDF at 
a single spatial location (the 1-point PDF), but in general 
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the PDFs are a family of N-point distribution functions for 
the probability at N positions in space. The random fields 
of interest are the density fields of both dark matter and 
galaxies. 

PDFs are useful in cosmology because, in principle, they 
encode much of the information contained within the full 
hierarchy of correlation functions, thus providing valuable 
information about gravitational evolution and initial condi- 
tions. Furthermore the discrete analogy of the 1-point PDF, 
the counts in cells statistic, is a relatively straightforward 
quantity to measure from galaxy surveys (Hamilton 1985, 
Alimi et al. 1990, Szapudi et al. 1992, 1996, Gaztanaga 1992, 
1994, Bouchet et al. 1993, Kim & Strauss 1998), allowing for 
easy comparison between theory and observation. 

The shape of the PDF is strongly influenced by the non- 
linear effects of gravitational instability. A cosmic field that 
is initially Gaussian random, the generic prediction of most 
inflationary models, will remain so only when its evolution 
is linear. When the evolution becomes nonlinear, the higher- 
order moments of the field (more correctly the cumulants) 
become non-zero for the first time resulting in a PDF that 
may be strongly skewed about its mean and cuspy around 
its peak. 

Attempts to describe theoretically this evolution have 
been numerous (Fry 1985, Coles & Jones 1991, Bernardeau 
1992, 1994, 1996, Bernardeau & Kofman 1995, Juszkiewicz 
et al. 1995, Colombi et al. 1997, Gaztanaga et al. 1999, Tay- 
lor & Watts 2000) with the result that, at least for quasi- 
linear evolution of the matter field, the behavior of the PDF 
is quite well understood. Since the density field of galaxies 
inferred from a galaxy redshift survey may not be a good 
representation of the underlying dark matter it is unlikely 
that the PDF of the dark matter density field is a realistic 
approximation for that of galaxies. Just as nonlinear grav- 
itational evolution drives the PDF away from Gaussianity, 
so too will nonlinear bias and redshift distortions. The chal- 
lenge is to separate out each of the effects and quantify the 
evolution of the PDF in terms of a few essential parameters. 

There have been significantly fewer attempts to calcu- 
late the evolution of the PDF in redshift space than in real 
space. Most notably, Hui et al. (2000) extended the work 
of Bernardeau & Kofman (1995) to calculate the evolution 
using the Zel'dovich approximation. Despite being an excel- 
lent approximation for the nonlinear gravitational dynamics 
Zel'dovich's solution does not produce a good fit to the PDF 
of N-body simulations. This is principally due to the forma- 
tion of caustics in the density field at shell crossing. The 
result for the Zel'dovich PDF is an asymptotic high density 
tail that falls off like in both real and redshift space. 
Tails like these are not observed in simulations. 

Another way of incorporating the effect of redshift 
distortions and bias is to use the Edgeworth expansion 
(Juszkiewicz et al. 1995, Bernardeau & Kofman 1995). In 
this approximation, the PDF is reconstructed from the mo- 
ments of the field which must be calculated separately us- 
ing perturbation theory. Such a calculation was done by 
Juszkiewicz et al. (1993; but see Hivon et al. 1995 for more 
details), who used second order Lagrangian perturbation 
theory (Bouchet et al. 1995) to map the skewness of an un- 
biased field into redshift space. They did not, however, try 
to use the skewness to evolve the PDF via the Edgeworth 
expansion. 



A problem with using the Edgeworth expansion is its 
tendency to produce unphysical features in the PDF. In par- 
ticular is the appearance of negative probabilities at low 
densities and the formation of unphysical "wiggles" in the 
high density tail of the distribution when the variance grows. 
These problems can be overcome by reconstructing the PDF 
using a Gamma expansion (Gaztanaga et al. 1999) which 
shows a markedly better agreement with N-body data in 
real space. 

In this paper we extend the formalism developed in a 
earlier work (Taylor & Watts 2000, henceforth TW2000) 
where we showed how the PDF transforms when the mat- 
ter density field is propagated to second-order. This method 
is based on an exact propagation of Gaussian initial proba- 
bilities using the Chapman-Kolmogorov equation from sta- 
tistical physics (e.g van Kampen 1992). In this paper we 
develop this approach to include the transformation to a 
local, second-order biased galaxy distribution, and apply 
second-order Eulerian perturbation theory to describe the 
mapping into redshift-space. We show how the method can 
be naturally extended to incorporate a stochastic (or hidden 
variable) bias, although we leave detailed calculations for a 
later paper (Watts & Taylor in preparation). Our calcula- 
tion also provides a new analytic solution for the skewness 
in redshift-space. This result is different to that of Hivon 
et al. (1995) who estimated the skewness by inverse trans- 
form of the bispectrum. However they where unable to find a 
closed-form solution, and did not include the effects of non- 
linear galaxy biasing. Here we derive a closed-form solution 
for the skewness, including second-order bias, and find that 
the nonlinear bias has a large effect. 

The layout of this paper is as follows. In §2 we discuss 
the derivation of the 1-point Probability Distribution Func- 
tion. The new result for the skewness, and a discussion of 
stochastic biasing may also be found here. In §3 we illustrate 
the various dependencies on cosmological parameters of the 
shape of the PDF. Our results are compared with N-body 
simulations and other approximations in §4, conclusions are 
given in §5. 



2 THE GALAXY DISTRIBUTION FUNCTION 
IN REDSHIFT-SPACE 

2.1 Redshift distortions and biasing in 
second-order perturbation theory. 

2.1.1 Second- order perturbation theory 

In Eulerian perturbation theory the density field S(x, t) at 
real space comoving position x and time t in a flat Universe 
can be expanded into a series of separable functions; 

oo oo 

5(x, t)=^ 5 ^{x, t) = ^2 D " (*) ), (i) 

n— 1 n — 1 

In curved space this expansion is only separable to second- 
order. The evolution of 5(x, t) can therefore be traced into 
the nonlinear regime by solving the fluid and Poisson's equa- 
tions for each order in the perturbation expansion (Peebles 
1980, Juszkiewicz 1981, Vishniac 1983, Fry 1984, Bouchet 
et al. 1992, Bouchet et al. 1995); 

8 = 5i+6 2 
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6 1 + ±(2- K )5 2 1 -r 1 .g+±(l + K)E 2 , 



(2) 
(3) 



where 

ri(x,t) = VSi(x,t), 

is the gradient of the linear density field and 

g(x,i) = -VV- 2 <5i(x,t), (4) 

is the linear peculiar gravity field^| where V -2 is the inverse 
Laplacian. The trace- free tidal tensor is given by 



Eij(x,t) = (ViVjV 2 - ^5ij)Si(x,t). 



2.1.2 Redshift space distortions 



(•>) 



To determine the redshift space density field one must con- 
sider the mapping from the real space comoving position to 
redshift space comoving position, s, given by (see the excel- 
lent review by Hamilton, 1998, for a full account) 



s = r + ru 



(6) 



where u = r-v(r,t)/ Ho is the projection of the velocity field 
along the line of sight. In the distant observer approximation 
(Kaiser 1987) the result can be written as a series in u and 
S, 



S s (s,t) 



(-«)" d n f8(s,t)-u' 



^— ' n! ds n \ 1 + w 

n = 



(7) 



where ' = d/dr and all quantities are evaluated in redshift- 
space coordinates. 

Expanding this to second order we find 



S B = S - u' - [u{5 - it')]' 



2.1.3 Galaxy bias 



(8) 



Next we assume that the density field of galaxies, smoothed 
on some scale R, is some local function of the underlying 
smoothed field of dark mass. Following Fry & Gaztanaga 
(1993) we expand the galaxy density in powers of 5; 



(0) 



The coefficients in this expansion, b n , are the bias param- 
eters. We make no assumptions about the biasing function 
other than that it is local and that it may be expanded 
in a Taylor series. This is a deterministic Eulerian biasing 
scheme, but can be generalised to a stochastic Eulerian bi- 
asing scheme to allow for the hidden effects of galaxy forma- 
tion (Pen 1998, Dekel & Lahav 1999). In Section |1| we show 
how our intrinsically probabilistic approach can be used to 
incorporate a stochastic Eularian bias. Other alternatives 
for bias are nonlocal Eulerian biases (Bower et al. 1993) and 
Lagrangian biasing schemes, which are intrinsically nonlo- 
cal in Eulerian coordinates. The latter is possibly the most 
natural to arise from following halos in the Press-Schechter 



In this paper we define 4-irGpo = 3/2QH 2 = 1 and the expan- 
sion parameter a(t) = 1, since our final distribution function will 
be dimcnsionless. 



approach to galaxy formation (Press & Schechter 1974) . Our 
approach is more phenomenological and we do not consider 
these possibilities here. 

Combining equations (Q) and (^|) gives to second order 
(Heavens et al. 1998) 



Sg = b\5 — u + — (<5 



0o J 



bi8u — b\5'u + uu" (10) 



where all quantities are evaluated at s. 

In order that this expression has the correct expectation 
value, (Sg) = 0, we have set &o = — &2O0 /2- Finally we rewrite 
equation ([To]) in terms of the linear density, gravity and tidal 
fields we shall use in later calculations; 



8 s g = 6g tl + 5 s a , 2 + A*,! + l 

where 

5"g t i = 6i5i, 

S g ,2 = M2 + y(<5i ~ cr 2 ), 

a:, = h {e zz + , 



A 



(11) 

(12) 
(13) 

(14) 



= Ag 2 ! +5g A A ga - ^-K, ( IlL + y 



+ y k ( n - + t) ~ fl9z {blVz + flFzzz) ■ (15) 

The first two terms in equation dll| ) deal with the non-linear 
evolution and second order bias, the last two terms bring in 
the effects of redshift distortions. 

The new dynamical variables required for redshift-space 
are the gradient of the tidal field 



Fij k {r,t) = ViVjV k V Si(r,t), 
and the second-order tidal fields 

n&(r,t) = (ViV,-V- 2 - iftj)*?(r,t), 
and 

ng(r,t) = (ViVjV- 2 - ^)E 2 (r,t). 



(16) 
(17) 
(18) 



Note that in writing equations ([n]) to ([li)) we have made the 
plane parallel approximation, evaluating all of the redshift 
space contributions along a single cartesian axis. 

The remaining parameters in the second-order model 
are (Bouchet et al. 1992, see also Catelan et al. 1995) 

— ^2 „ ,_ n -2/63 

k = w — 3/7S2 ' open universe 

^1 

w -3/7fi,7 1 1/143 flat universe (19) 



where Si(x, t) — D± (t)ei(x), and D±(t) w (1 + z)~ n is the 
linear growth function for an open model (Peebles 1980). 
The redshift-space parameters are (Peebles 1980, Lahav et 
al. 1991, Martel 1991) 

_ dln_Di 



din a 



f2 5/9 



and (Hivon et al. 1995) 
dlnD 2 



P. 



din a 



2fi 4 / 7 



open universe 
flat universe 

open universe 



(20) 
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2ft 



6/11 



flat universe 



(21) 



Note that our expression for the redshift distorted and 
biased galaxy distribution, equation (|ll"|), does not agree 
with Hivon et al. (1995), even taking into account their 
slightly different projection into redshift space. Hivons et 
al.'s expression does not appear to have the correct expec- 
tation value, i.e. their quantity (5 s ) 7^ 0. 



2.2 The distribution of initial fields 

Following the analysis of TW2000 we aim to find the joint 
probability for the fields in equation (|10|). Defining the pa- 
rameter vector y = (<5i, 77, g, E, H s zz , 11^, F zzz ), the initial 
distribution function is given by 



P(y) = 



exp 



((27t)™ I detC |)V2 
where the covariance matrix is 
C = (yy*). 



C~ x y 



(22) 



(23) 



Taking the plane-parallel approximation for redshift-space 
distortions represents a significant simplification since any 
dependence of Sg on the off-diagonal parts of Iff,, Tlfj and 
Fijk is removed, reducing the size of the required covariance 
matrix from 34 x 34 to 15 x 15. 

Equation (^2|) is an approximation. The variables II s 
and H E are not strictly Gaussian random fields as they are 
generated from the square of linear fields. However, our def- 
inition of these quantities as trace-free makes them uncor- 
rected with all of the other fields. If we then assume that 
their distribution is Gaussian, as we may expect them to be 
as a result of the Central Limit Theorem, they become sta- 
tistically independent Gaussian fields. To test our Gaussian 
approximation of the distribution of the Il-fields we gen- 
erated random Gaussian 5-fields and calculated H s zz . From 
this we estimated its distribution and found that a Gaus- 
sian with variance o 2 (Tl zz ) = 8<To/45 (see equation [m]) was 
indeed a good approximation. Later on, in Section 2.5, we 
shall see that these terms do not contribute to the skewness 
of the final distribution to lowest order. Hence the effects 
of this approximation will only be apparent in the higher 
moments and at higher order. 

The non-zero elements of the covariance matrix are 
given by. 



(8 2 ) 

{gig 3) 

{EijEki) 

<nLnL) 
(giFjki) 

{F zzz r) z } 



2 

a , 



(ViVi) = 3C1 Sij, 



iV° 2 (* 



kSji + SuSjk — —Sij Ski 

45 a ° < n « n "> = 135 a ° 
cro(SijSki + SikSji +SuSjk) 



15 



(F ZZZ F ZZZ ) — —a 2 



(24) 



Evaluating the action of equation (E2T) we find 



y'c- 



s + 



(1-7^) 



(71(7-1 



+ „ 
G 2 4(7 



63 / 



+ 



45 
8a 4 



with determinant 
22 



(n s zz y 



+ 



135 
22(7^ 



(nf z y 



The correlation parameter is defined by 



(25) 



(26) 



(27) 



which is the correlation coefficient of the velocity and gra- 
dient density fields. The variances are defined as 



<(fc) = D?(t) 



dk_ 

2^2 



k 



(28) 



where R a is the smoothing scale. For linear CDM power 
spectra 7„ is found to lie in the range 0.50 < j u < 0.65 
(TW2000) for a wide range of scales. TW2000 discuss the 
effect of varying y v and find that it is very small in the 
given range. We henceforth adopt the value 7„ = 0.55 for 
all subsequent plots. 



2.3 Propagation of the density distribution 
function 

The multivariate Gaussian distribution for initial fields is 
propagated to later times using the Chapman-Kolmogorov 
equation; 



daW{a\(3)P(a) 



(29) 



where W(a|/3) is the transition probability from the state a 
to (5. For a deterministic process, such as the gravitational 
evolution of a system from the linear to non-linear regimes, 
the transition probability is given by a Dirac delta function 



W{5 s g \y) = 8 D \5i - 8{ - S% — A" 



(30) 



The advantage of using the Chapman-Kolmogorov approach 
is that is generalizes the change of variables to allow for 
stochasticity in the transformation between initial and final 
states. This will be further discussed in §2.4. 

The probability distribution function for Sg can be writ- 
ten as an expectation over the stochastic variables y 



P(S s g ) = (5 D (5 s g -5t-5i-Ai-Ai)) v 

The Characteristic Function for P(8g), defined by 



Q{J) 



dSgP(Sg) exp (— iJSg^ 



and with inverse 



dJ 



P(5 S g )= / ^G(J)exp(iJS s g ) 

can be expressed as the expectation value 

Q(J) = <exp -iJ{8{ + <5| + Aj + A s 2 )) y . 

As in TW2000, equation @ reduces to a set of Gaus- 
sian type integrals which yield the probability generating 



(31) 



(32) 



(33) 



(34) 
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function, Q(J), for a smoothed galaxy density field in red- 
shift space; 



q{j) = rK i/2ex p 

2 J 2 a 2 f 2 



exp 



J 2 o 2 N 2 



201 



x exp 
x exp 



45 03 



fOi ) 



1 ,2 8 f 2 2 1379 

V a ° hK 1080 



(35) 



where 



1 + 2iJa 2 } A 
4 . 



02 = (l + jgiJo$B 



03 



4. l2n 8 . T 2 ,2 8 J 2 a£H 2 
1 + i5^ 5+ 45 lJ<To/l+ 45^r 



, 2 2, . 1 r 2 4, 2 (1 — ll) 



i t2 4 
+ J cr 



i-4^o( /i2 ^y M 

(1-7* 



M- 



9 7 2 



M - 



™A + L!L(a -^ 2 

15 J V + 21 7 2V 25 7 " 



(36) 



and 




A = 


i(2- K )& 1 ++i/ 1 2 + ib 1 / 1 -i J 


B = 


^(l + K)6i + i/a« 


H = 


/a (* + §A) 


M = 




N = 





(37) 

The 1-pt PDF can then be found by numerical integra- 
tion of equation (p^). Equations (|3fj) to (|37j) represent the 
main analytical result of the this paper. 



2.4 Stochastic Bias Schemes 

Instead of the local deterministic prescription for galaxy bi- 
asing described by equation (jt^) we may choose to implement 
a stochastic Eularian biasing scheme. In stochastic biasing 
(Pen 1998, Dekel & Lahav 1999) the relation between the un- 
derlying density field of dark matter and the galaxy density 
field is a random process with joint distribution P{S g , 8). 

If we assume that the biasing transition distribution, 
P(8 g , 8), is a bivariate Gaussian of the form 

P(8 9 ,8) 



2n[a 2 b +b 2 a 2 (l~r?)] 



x ex P~2 



1 S 2 + 8 2 b\{l + (J 2 Jb 2 al) - 2rb 1 8 g 8 



°l + b 2 o 2 {l 



(38) 



where the covariances between the fields are given to first 
order by 



{8g) = b\o%+o\, 
{S g S) = hra 2 



(39) 
(40) 



where <T(, is the variance of the random component deter- 
mining galaxy formation and r is the correlation coefficient 
between the mass and galaxy distribution, then the galaxy 
distribution function can be written 



P(S 3 ) = {P(8 g J(y))) y , 



(41) 



where f(y) — 81+82 in real space. The characteristic func- 
tion for galaxies is then 



(42) 



Gs g {J) = J dJ'G s (j')G Sg , d (J,j') 
where 

In G Sg , s (J, J') = -\(J 2 al + J' 2 (b 2 1 <Tl+<T 2 b ) + 2Jj'b 1 r<Tl){AZ) 



Equation ( J42| ) can be evaluated numerically. 

The distribution function of galaxies can also be evalu- 
ated by numerically integrating 



P{S B )= / ^G S (J)P(S 9 ,J) 

—00 

where 



P(8 g , J) = / dSe U5 P(8 g ,S) 



(44) 



(45) 



is the partially transformed characteristic function. We shall 
explore the effects of stochastic bias elsewhere, but note 
that it can be naturally incorporated into our probabilis- 
tic scheme. For the remainder of the paper we shall only 
discuss the deterministic scheme. 



2.5 Skewness of a biased density field in redshift 
space 

The skewness of the biased redshift-space galaxy distribu- 
tion can be found from the characteristic function, G(J), by 
differentiation. The moment parameters are defined as 

(He 



S n — 



(46) 



where the cumulants (or connected moments) of the field 
are given by 

^ = ^] lne(J - 0) - (47) 
In redshift space the skewness parameter becomes 



2 



where a s is the linear, redshift space variance of the galaxy 
density field, 5"„. 

Taking the third derivative and setting J — we find 
the redshift-space skewness parameter; 

SlbtF 2 = 46? + + 3&2 6f(l + f) 2 

_ 2k&?(1 + ~/3 2 ) (l + l/h + j^ft) 

+ 2fet/3 1 (l + ^/3 1 + |/3 1 2 )+|^ 1 (49) 

where F(f3\) is the linear redshift space enhancement factor 
for the redshifted variance given to lowest order by (Kaiser 
1987) 
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l + f/fc + 

3 5 



and 



fi 
bi- 



ll 
bi' 



h = t 1 - (32 
The variance in redshift space is just 



(50) 



(51) 



(52) 



Equation ([49|) for the galaxy skewness in redshift space is the 
second major result of this paper. It is interesting to com- 
pare this with the results of Hivon et al. (1995) who derived 
the density skewness in redshift space. Hivon et al. derived 
their result by transforming the redshifted bispectrum but 
did not find a closed form solution and were left with a term 
containing an integral that was to be determined numeri- 
cally. Our result for the skewness should coincide with that 
of Hivon et al. for the special case of b\ = 1 and &2 = 0, 
but does not. The differences between our result and that 
of Hivon et al. may result from the different approaches, 
or from the difference in initial expressions for 5 s g . We have 
not not been able to trace the origin of these differences. 
We have verified equation ( ]49| ) by a direct calculation of the 
redshifted galaxy skewness, using equation ( pi] ) along with 
the correlators in equation (^4j), to evaluate the quantity 
<(^) 3 >- 

Equation (W9) displays the hierarchical scaling associ- 
ated with galaxy clustering. It has already been shown that 
a local bias will retain a hierarchical form if the underlying 
field displays hierarchical behavior (Fry & Gastanaga 1993) , 
and that the lowest order of perturbation theory reproduces 
the hierarchical structure. Our result would seem to indi- 
cate that this structure holds for galaxies in redshift-space. 
This is an important result as it strengthens the claim that 
if galaxies display a hierarchical scaling then it is consis- 
tent that the underlying density field was initially Gaussian 
distributed and bias is local. Our results suggest that this 
can be extended as a test of galaxy distributions in redshift- 
space. 

In Figure 1 we plot the skewness parameter S3 as a 
function of linear bias, 61, for different values of il m and a 
quadratic bias 62 ■ An immediate result is that in all the mod- 
els the redshift-space skewness parameter, S3, is less than 
the real-space skewness parameter, S3, until 61 > 1.5. This 
may seem surprising as one expects the effects of redshift 
space distortions will tend to make the density field look 
more evolved along the line of sight. This does happen, but 
the second-order skewness is dominated by the first-order 
increase in the variance, so that the ratio ((3 s ) 3 } / ((3 s ) 2 } 2 
is smaller than its real space counterpart. This effect was 
previously observed by Hoyle et al. (2000). 

Beyond b\ « 1.5 the redshift skewness parameter rises 
above its real space value for all the models. This is because 
for large values of the linear bias parameter the second or- 
der skewness becomes comparable in magnitude to the red- 
shifted variance. 

For j3i ~ P2 ~ 1 and 61 « 1 the redshifted skewness can 
be approximated by 



which is accurate to a few percent. This can be compared 
with the undistorted skewness (Fry & Gaztanaga 1993) 



53 = ^1+3^ 



7 6 



(54) 
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Without second-order bias, SJ is only weakly dependent 
on f3i. The main effect of redshift-space distortions in this 
case is to change the dependence on b\. The second-order 
bias term, however, has a much stronger dependence on j3i, 
and has a lower amplitude than the undistorted term when 
Pi = 1. 



3 THE SHAPE OF THE PDF 

The solution for the generating function, G(J), given by 
equations (^H|) - (^) may be integrated numerically with- 
out too much difficulty to yield the full distribution function 
P(5). In figure 2 we show the shape of the resulting PDF 
for a range of values of the parameters 61, 62 and Sl m . 

To second order there are two important quantities 
working to distort the shape of the PDF from a Gaussian 
with linear mass variance ao- Firstly is the variance of the 
resulting field which acts to broaden the distribution. Sec- 
ondly is skewness, which produces asymmetry in the PDF 
about its mean at 5 = 0. These two quantities are effected to 
varying degrees by non-linear evolution, by non-linear bias 
and by the distorting effect of galaxy peculiar motions. 

For the plots in figure 2 we chose a moderate value for 
the linear variance of the underlying dark matter density 
field, o"o = 0.20. The correlation parameter, 7„, was 0.55. 
In the top panel we show the effect of a purely linear bias 
with values bi =0.7, 1.0 and 1.2 and with 62 ~ 0. For the 
lower panel the linear 61 's match those above but in this 
case 62 = 1-0 for every plot. Each plot shows three different 
PDFs: one (solid lines) in real space, one in redshift space 
for the case Qq = 1.0 (dotted lines) and a third in redshift 
space but with fio = 0.3 (dashed lines). We have inset the 
same plots on logarithmic axis in order to emphasize the 
tails of the distribution. 

In real space and with linear bias only, the shape of the 
PDF is dominated by the b\ boost to the variance. Where 
b\ > 1 the change to SJ is small and any effect on the 
shape of the PDF caused by the skewness is masked by the 
enhanced variance When 61 < 1 there is a more substantial 
change to S| but in this case the variance becomes small 
enough that the PDF looks relatively Gaussian regardless 
of what is happening to the skewness. 

To second order the non- linear bias parameter, 62, does 
not alter the variance of the underlying mass density field. 
The changes to the shape of the PDF when / then 
reflect only the corresponding changes to S3 We observe 
that the real space PDFs appear to peak sharply around 
5 = — la s with an abrupt drop off of the low density tail 
to the left of the peak. The high density tail is extended 
beyond the corresponding case where 62 = 0. With low 61 
the effect of &2 is most pronounced, causing an increasingly 
abrupt drop off in P(5) on the low density side of the peak. 
This pronounced effect on the PDF would appear to be an 
important signature of quadratic bias, which cannot be re- 
produced by any combination of other parameters. 

In redshift space the PDF is dominated by the enhance- 
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Figure 1. Log-log plot of the redshift-space galaxy skewness parameter, S3, as a function of H m , b\ and 62- The panels show increasing 
values of Q m = 0.1, 0.2, 0.7, 1.0. The x-axis is the linear bias parameter, bi. Lines going up are the quadratic bias parameter 62 = 0.0, 
0.5, 0.7, 1.0. Straight faint lines are the real space skewness, S3, and are independent of Q m . Darker curves are for the full redshifted 
skewness for an open (solid) and flat (dotted) universe. 



merit of the variance due to linear redshift space distortions. 
As discussed in §2.5 the second-order change in the skewness 
is less than the linear change in variance resulting in a lower 
skewness parameter, S3, in redshift-space than in real space. 
The result for the PDFs is a broader and less asymmetric 
appearance. This is most clearly seen in the PDFs where 
b'2 7^ 0. For these plots, the sharp peak in P(5) around 
S = — la s is significantly damped, and both the low and 
high density tails are much shallower than their real space 
counterparts. 



4 COMPARISON OF RESULTS WITH 
N-BODY SIMULATIONS 

4.1 The simulations 

In this section we illustrate how our theoretical PDF com- 
pares with that measured from cosmological N-body simula- 
tions. For the comparison we chose to use Hugh Couchman's 
Adaptive P 3 M N-body code (Couchman 1991) to model the 
evolution of the dark matter. The simulation volume was a 
cube of comoving side 200 h~ Mpc with periodic bound- 
ary conditions. We chose a CDM initial power spectrum 



(Bardeen at al. 1986) that was normalized to match the 
present day abundance of clusters. The variance on a scale 
of 8 h~ Mpc when the particle distribution was smoothed 
with top hat filters was as = 0.5 f!^ 0,5 in the final time-step 
of the simulation. The shape parameter used was F = 0.25, 
representing the best fit of the CDM power spectrum to 
galaxy clustering data. The simulation was performed on a 
128 3 Fourier mesh with 100 3 particles. 

We investigated the numerical PDFs in three sets of 
circumstances: 

• in real space with bias, 

• in redshift space without bias, 

• in redshift space with bias. 

The first two scenarios were trivial to construct from the 
simulations. For the case of bias in real space, we binned the 
data and estimated the overdensity from S = (n p — n p )/n p , 
where n p is the number of particles in a cell and n p is the 
mean number of particles per cell. The biased distribution 
was then estimated by the transformation 

*, = M + y(5 2 -^) (55) 
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Figure 2. The effect of linear and quadratic bias parameters on the 1-point PDF when the variance of the underlying mass density is 
(To = 0.2. From left to right the linear bias is b\ = 0.7, 1.0, 1.2. On the top row the second order bias parameter, 62 is 0.0 and on the 
bottom 1.0. In all plots a solid line represents the PDF in real space, a dashed line is for redshift space when Q = 0.3, and a dotted line 
for redshift space when Q = 1.0. 



where cto is the variance on the scale of the binning. 

Redshift distortions, in the absence of bias, were calcu- 
lated using the peculiar velocities of the simulation particles. 
The distortions were made plane parallel in order to match 
our approximation from §2.1. The resulting particle distri- 
bution was smoothed with Gaussian niters of radius R a . The 
PDF was then evaluated from the relative abundance of 5 
across the grid. 

Measuring the combined effect of redshift distortions 
and bias was more difficult. A simple combination of the 
above methods could not be used, as we needed to identify 
biased galaxies before making the transformation to red- 
shift space. Our basic biasing method calculates the biased 
density field on the grid so that information about indi- 
vidual galaxies was lost. In order to identify individual bi- 
ased galaxies in the simulation we sampled a population of 
galaxies from the simulation particles so that the number of 
galaxies in a cell, n g , was 



(56) 



The resulting distribution could then be transformed to 
redshift-space by using the galaxies velocity. 

The size of the grid was dictated by the need for n p 



to be sufficiently larger than n 9 so that underdense regions 
could be enlarged by the bias. Given this constraint we chose 
to smooth the particle distribution on a coarse mesh of be- 
tween 9 and 11 cells, with a cell size of / = 22/i~ 1 Mpc and 
18ft _1 Mpc, respectively. As we calculate the linear variance 
using a Gaussian filter (equation ^) this corresponds to a 
smoothing scale of Rs — l/\/T2 = 6.5 and 5/t _1 Mpc. The 
total number of galaxies was constrained by the requirement 
that n g was not larger than n v in areas of high density, so we 
set the total number in the simulation volume N g = 5 x 10 s . 

The errors on the numerical PDFs for the biased and 
distorted only simulations were taken from the standard de- 
viation over 5 independent realisations of the simulated vol- 
ume. Errors due to shot noise were incorporated into the 
theoretical PDF (TW2000) based on the mean density of 
particles in a cubical cell of side I = \/T2R s . In practice the 
shot noise contribution was small for simulations due to the 
high particle density. 

In the case of a combined biased and redshift-space dis- 
torted PDFs the need to sample the galaxies on a coarser 
mesh meant that the resulting PDF from an individual sim- 
ulation had a larger scatter from simulation to simulation 
for the same value of o 2 a . To avoid this we used a simulation 
with a smaller clustering variance. A relatively large number 
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Figure 3. Comparison with N-body simulation in real space when cr s = 0.20 and with linear bias 61 = 0.7 (top panel) and 61 = 1.5 
(bottom panel). The solid line is the PDF for this work, the dotted line shows the reconstruction of the PDF from the Edgeworth 
expansion with the skewness parameter from §2. Filled squares represent the simulation data. The linear mass variance was <tq = 0.29 
and 0.13 for the top and bottom plots respectively. 



of particles could then to be found in each cell. For the case 
R s ~ 5/i -1 Mpc there numbered on average 375 galaxies per 
cell sampled from around 750 simulation particles. When the 
numerical PDFs were averaged over 7 independent realisa- 
tions the results were reasonably smooth, though the error 
bars (again the standard deviation over the 7 realisations) 
do reflect the scatter. 



4.2 The Edgeworth Expansion 

In addition to the PDF calculated from this work we also 
examine the PDF reconstructed from the Edgeworth expan- 
sion (Juszkiewicz et al. 1995, Bernardeau & Kofman 1995) 
using the redshift-space skewness parameter derived in Sec- 
tion 2.5. The distribution function from the Edgeworth ex- 
pansion is, to second order, 



P{y) 



1 + yH 3 {y)S' 3 







exp 


. 2 . 



(57) 



where is the redshift-space galaxy variance given by equa- 
tion (|52[), S3 is the skewness parameter from equation ([49]), 
H n {y) is an n th -order Hermite polynomial and v = 8 s /a s is 
the scaled redshift-space density field. 

The main difference between the Edgeworth expansion 
and our own model for the PDF is that the Edgeworth re- 
lies upon two levels of approximation: firstly in the moments, 
which come from — in our case — a second order perturba- 
tive calculation; and secondly in the Edgeworth series itself, 
which reconstructs the non-linear PDF by expanding about 
a Gaussian distribution. Our PDF comes directly from the 
perturbation theory without any further constraints other 
than those implicit in the theory itself. In this way our 
model is perhaps more representative of the perturbation 
theory and its predictions and limitations. Another dissim- 
ilarity between the models, and a problem with trying to 
derive the PDF to second order, lies in the fact that second 
order perturbation theory does not get the higher moments 
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Figure 4. Comparison with N-body simulation in real space when cr s = 0.20 and with linear bias and second order bias. In the top 
panel b\ = 0.7 while in the lower panel b\ = 1.0 In both cases the second order bias parameter 62 = 1.0. The solid line is the PDF for 
this work, the dotted line shows the reconstruction of the PDF from the Edgcworth expansion with the skewness from §2. Filled squares 
represent the simulation data. The linear mass variance was ctq = 0.29 and 0.13 for the top and bottom plots respectively. 



of the distribution correct. In our calculation all of the higher 
moments exist but have terms missing that would come in 
in a higher order calculation. This may strongly effect the 
shape of our PDF, particularly around the peak. Such prob- 
lems are not an issue for the Edgeworth and other similar 
approximations because in these models the cumulants are 
either put into the series explicitly, or else are zero. 



4.3 Results 

In comparing the models with simulation data we con- 
structed the PDF so that the variance of the distorted, bi- 
ased density field, a 2 3 , was constant for all choices of the 
parameters Q, bi and 62. This was arranged by using an ap- 
propriate value for 00 for each plot. For the case of bias in 
real space and of no bias in redshift space we chose a s — 0.2. 
This relatively moderate variance was necessary because for 
the cases where bi < 1 the underlying ao had to start quite 



large. For the combination of bias and redshift distortions we 
were limited to starting with a slightly higher linear mass 
variance because the simulation PDFs became dominated 
by sampling variance when the cell size was large. For these 
plots a s — 0.25. 

Figures 3 and 4 show the behavior of the PDF with 
various combinations of the bias parameters and with no 
redshift space distortions. The density parameter for these 
simulations was £1 — 1.0, though in real space the quasi- 
linear evolution is not sensitive to this quantity (Bouchet et 
al. 1992, Martel & Freudling 1991). 

The first set of plots (figure 3) show the effect of a lin- 
ear bias only, 61 = 0.7 in the top panel and 61 = 1.5 in the 
bottom. The linear mass variance was 00 = 0.29 and 0.13 
respectively. The dominant effect on the shape is through 
the variance which tends to broaden and lower the ampli- 
tude PDF for 61 > 1 and conversely makes the PDF more 
Gaussian when bi. < 1. The fit to the data (points) of the 
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Figure 5. Comparison with N-body simulations in redshift space for f!o = 1-0 an d ^0 = 0.3 when a s = 0.2. The bias parameters were 
in both cases b\ = 1.0 and 62 = 0, i.c no bias. The solid line is the PDF for this work, the dotted line shows the reconstruction of the 
PDF from the Edgcworth expansion with the skewness parameter from §2. Filled squared represents the simulation data. The linear 
mass variance was ao = 0.15 and 0.17 for the top and bottom plots respectively. 



PDF from this work is good in both cases, though when 
do is relatively large our model misses the peak of the dis- 
tribution and appears to be slightly too strongly skewed. 
The Edgeworth approximation (dotted lines) also shows ex- 
cellent agreement with the simulations, particularly in the 
vicinity of the peak. The plots show the absolute value of 
the Edgeworth PDF. The "lobes" in the low density tail of 
the distribution represent negative probabilities in the Edge- 
worth PDF. 

In figure 4 we show the fit to simulations when the sec- 
ond order bias term, 62, is TO. The linear bias was set to 
bi = 0.7 (top) and 61 = TO (bottom). The linear mass 
variance was ao = 0.29 (top) and ao — 0.20 (bottom). To 
the order we are interested in &2 only contributes to the 
PDF via £3 and the higher moments. The effect of the sec- 
ond order bias is to fill in the void regions and enhance the 
peaks. The result for the PDF is a sharp drop-off of the low 
density tail and a dramatic amplification of the peak just 



beyond that point. The high density tail is also extended 
to account for the increased regions of very high density. 
The fit to the simulations for second order biasing is good 
so long as ao is reasonably small, though still in the quasi- 
linear regime. The breakdown in the fit comes mainly in 
the low density tail which for the theoretical curves drops 
away too slowly when the variance is high. The Edgeworth 
PDF breaks down quite badly in the tails when &2 7^ 0. This 
suggests that the Edgeworth approximation becomes rather 
unstable when the skewness parameter is large, even when 
the variance is low. The peak of the Edgeworth PDF is not 
so badly affected, however, and fits the data nicely so long 
as ao is not too high. 

Figure 5 shows the fit of the PDF to an unbiased field 
of galaxies in redshift space. We ran two sets of simulations 
with Qq = 1.0 and 0.3. The linear variance used for the 
theoretical PDFs was a = 0.15 and 0.17. The fit to both 
our model and the Edgeworth PDF is very good, although 
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Figure 6. Comparison with N-body simulations showing the combined effect of bias and redshift distortions for Qq = 1.0 and when 
<j s = 0.25. The bias parameters were b\ = 1.2 and 62 = in the upper panel and b\ = 1.0, 62 = 1.0 in the lower. The solid line is the 
PDF for this work, the dotted line shows the reconstruction of the PDF from the Edgeworth expansion with the skewness parameter 
from §2. Filled squared represents the simulation data. The linear mass variance was <ro = 0.16 and 0.19 for the top and bottom plots 
respectively. 



the linear variance in each case was quite small to ensure 
that a s — 0.2. We found that our approximation began to 
break down when the redshift space variance was <j s — 0.4. 
Although the Edgeworth followed the peak and low density 
part of the distribution well to this variance and higher, 
its high density tail became unstable at a much lower a s . 
In both cases the PDFs look more Gaussian than in real 
space because of the reduction in S3. The theoretical plots 
are almost identical in each case, as expected since we have 
arranged for the variance of the redshifted field to be iden- 
tical in each case and anyway the skewness in redshift space 
is only very weakly dependant upon fi m when bi is close 
to unity. The numerical PDFs are marginally different, al- 
though we suspect that this is a numerical effect as suggested 
by the large error bars around the peak of the distribution 
for Q, m = 0.3. 

On much smaller scales we would expect there to be 
a real difference between the two PDFs as fingers of god, 



due to the pairwise velocities in clusters, become important. 
Our second-order analysis does not allow for these strongly 
nonlinear effects, and we restrict our analysis to scales large 
enough that these effects are not important. The agreement 
between theory and the simulations suggests that this is 
true. The effect we would expect to see would be a decrease 
in the variance and skewness of the PDFs in redshift-space. 

Our final plots (figure 6) show the combined effect of 
both bias and redshift distortions. In the top row b\ = 1.2 
with &2 = 0.0 and on the bottom we have set b\ = 1.0 and 
made 62 = 1.0. For both cases the density parameter was 
fi m = 1.0. In each of the plots a a = 0.25 — due to problems 
with creating the bias/redshift simulations we were unable 
to satisfactorily measure the PDFs for a s — 0.2. The lin- 
ear mass variance was therefore exo = 0.18 in the top plot 
and 0.19 in the bottom. The fit in both cases is very good 
— although the Edgeworth approximation becomes slightly 
unstable when &2 = 1.0 because of the relatively high skew- 
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ness. Although the redshift distortions do wash out some 
of the effect of the bias parameters, there are dependencies 
which are reproduced by the models to a reasonable degree 
of accuracy. This is encouraging for the purposes of con- 
straining cosmological models using the PDF and data from 
galaxy redshift surveys. 

In general there is a good similarity between all of the 
numerical PDFs and those found from either our analysis 
based on the Chapman-Kolmogorov equation or the Edge- 
worth approximation with the redshift space skewness pa- 
rameter S3. The Edgeworth expansion clearly breaks down 
sooner in the tails of the distribution, and for large values of 
the skewness parameter. Considering that both models use 
second order perturbation theory and have the same skew- 
ness and variance it is perhaps confusing that they behave 
so differently with various combinations of the parameters. 
However we must not forget that the Edgeworth approxi- 
mation and our model rely on very different approaches to 
obtain the PDF as discussed in 84. 2. 




4.4 Constant f3 PDFs 

An interesting and potentially useful feature of modeling the 
PDF in redshift space comes when one considers parameter 
combinations that are degenerate to linear order. The com- 
bination 



Pi = 



(58) 



for example, is a common quantity to measure in linear anal- 
ysis of galaxy redshift surveys (see e.g. Tadros et al. 1999 
for a recent measurement from the PSCz redshift survey). 

We constructed the PDFs of two fields sharing the same 
f3i but with different combinations of the cosmological den- 
sity and linear bias parameters, neglecting the effects of sec- 
ond order bias. We chose /3i = 0.5 with Q. m = 0.3 and 
bi — 1.0 for one model and Q m = 1.0 with bi = 2.0 for the 
other. The linear mass variance for each was inferred from 
the observed abundance of clusters from (Viarma & Liddle 
1996) 



<r(R t , 



where 



cr 8 



Rt 



- 7 (H) 



7 = (0.3r + 0.2) 



2.92 + log 



Rt, 



Sft-iMpc 



(59) 



(60) 



and with V the CDM shape function. The abundance of 
clusters was parameterised by 



as = 0.6fi ' 



(61) 



where C « 0.49 for the closed and C « 0.43 for the open 
models. The underlying linear mass variance was found for 
each at constant top-hat filter radius, Rth- The resulting 
PDFs are shown in figure 6, the solid curve being the high- 
Q. model and the dashed line the low-51 model. Clearly there 
is a marked difference between the two PDFs, a result of the 
different ways in which bi and Q. contribute to the variance 
and the skewness. We investigate elsewhere whether this dif- 
ference can be used to constrain cosmological parameters. 
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Figure 7. Theoretical PDFs for a biased galaxy field in redshift 
space. The parameters are such that the combination (i = 0.5. 
For the solid curve f2o = 1.0 with bi = 2.0 and bi = whereas 
for the dotted curve fio = 0.3 with b\ = 1.0 and bi = 0. In both 
cases the filter was a Gaussian of radius 10 hr IMpc. 



5 SUMMARY 

In this paper we have derived an expression for the nonlin- 
ear 1-point probability distribution of galaxies in redshift 
space. This function is useful in the analysis of redshift 
surveys as it is a quantity that can be directly observed. 
We have treated the nonlinearity of the density field with 
second-order Eulerian perturbation theory, and transformed 
from the mass-density distribution to a galaxy distribution 
using a second-order local bias prescription. This is then 
mapped to redshift space, again using second-order pertur- 
bation theory. The transformation of the initial probability 
distribution to the evolved distribution is carried out us- 
ing the Chapman-Kolmogorov equation. This allows us to 
derive an exact expression for the second-order characteris- 
tic function for the galaxy distribution. We show that the 
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Chapman-Kolmogorov equation is more general than just a 
transformation of variables, as we can use it to calculate the 
effects of Stochastic Bias Schemes. We shall investigate this 
in more detail elsewhere. 

Taking the derivative of the galaxy characteristic func- 
tion we derive a new expression for the skewness parameter 
of galaxies in redshift space, S3. Unlike other derivations, 
we find a closed-form expression and include the effects of a 
quadratic bias term. We find that in general for values of the 
linear bias parameter 61 below 1.5, Sf is smaller than its real 
space value, S3. We also find that, while the first-order bias 
terms are largely independent of the linear distortion param- 
eter /Ji, or S7 m , for values of b\ around unity, as suggested 
by previous studies, the quadratic bias terms introduce a 
strong dependency on cosmological parameters. An analysis 
of the full 1-pt PDF in real and redshift space confirms these 
findings. In addition we find that quadratic bias produces a 
distinct sharp cut-of in the PDF for low-density regions. 

Comparing our PDF with those measured from N-body 
simulations we find good agreement for all combinations of 
parameters. We also find that the Edgeworth series, using 
our expression for the redshift-space skewness parameter, 
fits the numerical PDFs rather well when the series is trun- 
cated to second order and the redshift space skewness and 
variance are used. The models, particularly the Edgeworth 
approximation, fit the data most poorly when the linear 
mass variance and the skewness are both high {a a > 0.4). 
This occurs on small scales or where there is a large degree 
of non-linear bias. 

Our analysis leaves two problems unresolved. The first 
is the issue of fingers-of-god, which we do not model. This 
restricts the scales on which we can model the PDF, but does 
not seem to be a problem when we compare our results to 
simulations. We will test elsewhere if this is a problem when 
we come to use the PDF to extract cosmological parameters. 
The second issue is that of smoothing the final density field. 
We neglect this in our analysis, although it is included in the 
analysis of other PDFs (see e.g. Bernardeau 1994). Again, 
when comparing with the results of simulations we do not 
see a significant effect. This may be a result of the type of 
power spectra we have tested our model against. The effect 
of smoothing is to transfer power across scales and for some 
power spectra this transfer will be minimal. However for 
the range of spectra we have investigated we only require 
the variance to be correctly calibrated against simulations 
and we find good agreement. An advantage of the PDF over 
other statistical measures is that it is straightforward to see 
if the fit is poor. 

In future work the PDF we have derived will be com- 
pared with the PDF of galaxies measured from the PSCz 
galaxy survey (Watts & Taylor in preparation) . This will be 
useful both in testing the gravitational instability hypothe- 
sis and in constraining cosmological models, using data from 
the nonlinear regime of the galaxy distribution. This step is 
vital for maximising the amount of information available 
from galaxy redshift surveys. Although we find that redshift 
distortions do dampen some of the effects of bias, there are 
dependencies that may be exploited. We have shown how 
using the PDF in conjunction with the abundance of clus- 
ters may provide a way to distinguish between cosmological 
models that are degenerate in linear analysis. 
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